Monocular Extraction of 2.1D Sketch
نویسندگان
چکیده
The 2.1D sketch is a layered representation of occluding and occluded surfaces of the scene. Extracting the 2.1D sketch from a single image is a difficult and important problem arising in many applications. We present a fast and robust algorithm that uses boundaries of image regions and T-junctions, as important visual cues about the scene structure, to estimate the scene layers. The estimation is a quadratic optimization with hinge-loss based constraints, so the 2.1D sketch is smooth in all image areas except on image contours, and image regions forming “stems” of the T-junctions correspond to occluded surfaces in the scene. Quantitative and qualitative results on challenging, real-world images—namely, Stanford depthmap and Berkeley segmentation dataset—demonstrate high accuracy, efficiency, and robustness of our approach.
منابع مشابه
A global energy optimization framework for 2.1D sketch extraction from monocular images
The 2.1D sketch is a layered image representation, which assigns a partial depth ordering of over-segmented regions in a monocular image. This paper presents a global optimization framework for inferring the 2.1D sketch from a monocular image. Our method only uses over-segmented image regions (i.e., superpixels) as input, without any information of objects in the image, since (1) segmenting obj...
متن کاملBayesian Inference for Layer Representation with Mixed Markov Random Field
This paper presents a Bayesian inference algorithm for image layer representation [26], 2.1D sketch [6], with mixed Markov random field. 2.1D sketch is an very important problem in low-middle level vision with a synthesis of two goals: segmentation and 2.5D sketch, in other words, it is to consider 2D segmentation by incorporating occulision/depth explicitly to get the partial order of final se...
متن کاملSize Matters: Metric Visual Search Constraints from Monocular Metadata
Metric constraints are known to be highly discriminative for many objects, but if training is limited to data captured from a particular 3-D sensor the quantity of training data may be severly limited. In this paper, we show how a crucial aspect of 3-D information–object and feature absolute size–can be added to models learned from commonly available online imagery, without use of any 3-D sensi...
متن کاملتاثیر هم افزایی دوچشمی بر مولفه های موج پتانسیل برانگیخته بینایی
Background : To determine the effect of binocular summation on the time domain transient VEP wave's components. Methods : The monocular and binocular transient visual evoked potentials of 21 normally vision volunteers 18 to 24 years (mean ± SD, 20.7 ± 1.9) during a reversing checkerboard stimulus with spatiotemporal frequency of 2.18-4 cpd-Hz were recorded. The amplitude and latency of N75,...
متن کاملChinese Sketch Engine and the Extraction of Grammatical Collocations
This paper introduces a new technology for collocation extraction in Chinese. Sketch Engine (Kilgarriff et al., 2004) has proven to be a very effective tool for automatic description of lexical information, including collocation extraction, based on large-scale corpus. The original work of Sketch Engine was based on BNC. We extend Sketch Engine to Chinese based on Gigaword corpus from LDC. We d...
متن کامل